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1 TTTCTTCTGT TTGCTTACTC CCTATCCGGG GGCCCAAGGC GCTGTCTCCG 

51 CCGCCCAAGC CCC6CGTAAA CCTGGGTGAC CTCGGAGACA TCCGTTGGAG 

101 CATGAGTTCC CGACATCAGG CGGCGGCGGT GGTCCGGGAG AAACCCGGCG 

151 GCGGGGAGAT AAGCCTGCCC AGGAGGCAGG GGGCTGGGCT AGCTGCCCCG 

201 CCCCGCGCCT GACTTC6TTG G6GAGGGAGA CGCCCGGCTC CCGCCCCTAA 

251 CTAGCCCAGC CGCGCGGAGC GCCTGGGAGA GGAGAAGGAG CCGACCTGCC 

301 GAGATGGAGG CGACCG6CAC CTGGGCGCTG CTGCTGGCGC TGGCGCTGCT 

351 CCTGCTGCTG ACGCTGGCGC TGTCCG6GAC CAGGGCCCGA GGCCACCTGC 

401 CCCCCGGGCC CACGCCGCTA CCACTGCTGG GAAACCTCCT GCAGCTACGG 

451 CCCGGGGCGC TGTATTCAGG GCTCATGCGG CTGAGTAAGA AGTACGGACC 

501 GGTGTTCACC ATCTACCTGG 6ACCCTGGCG GCCTGTGGTG GTCCTGGTTG 

551 GGCAGGAGGC TGTGCGGGAG GCCCTGGGAG GTCAGGCT6A GGAGTTCAGC 

601 G6CCGGGGAA CCGTAGCGAT GCTGGAAG6G ACTTTTGATG GCCATGGGGT 

651 TTTCTTCTCC AACGGGGAGC GGTGGAGGCA GCTGAGGAAG TTTACCATGC 

701 TTGCTCTGCG GGACCTGGGC ATGGGGAAGC GAGAAGGCGA GGAGCT6ATC 

751 CAGGCGGAGG CCCGGTGTCT GGTGGAGACA TTCCAGGGGA CAGAAGGACG 

801 CCCATTCGAT CCCTCCCTGC TGCTGGCCCA GGCCACCTCC AACGTAGTCT 

851 GCTCCCTCCT CTTTGGCCTC CGCTTCTCCT ATGAGGATAA GGAGTTCCAG 

901 6CCGTGGTCC GGGCAGCTGG TGGTACCCTG CTGGGAGTCA GCTCCCAG6G 

951 GGGTCAGACC TACGAGAT6T TCTCCTGGTT CCTGCGGCCC CTGCCAGGCC 

1001 CCCACAAGCA GCTCCTCCAC CACGTCAGCA CCTTGGCTGC CTTCACAGTC 

1051 CGGCAGGTGC AGCAGCACCA G6GGAACCTG GATGCTTCGG GCCCCGCACG 

1101 TGACCTTGTC GATGCCTTCC TGCTGAA6AT GGCACAGGAG GAACAAAACC 

1151 CAGGCACAGA ATTCACCAAC AAGAACATGC TGATGACAGT CATTTATTTG 

1201 CTGTTTGCTG GGACGATGAC GGTCAGCACC AC6GTCGGCT ATACCCTCCT 

1251 GCTCCTGATG AAATACCCTC ATGTCCAAAA GTGGGTACGT GAG6AGCTGA 

1301 ATCGGGAGCT GG6GGCTGGC CAGGCACCAA GCCTAGGGGA CCGTACCCGC 

1351 CTCCCTTACA CCGACGCGGT TCTGCATGAG GCGCAGCGGC TGCTGGCGCT 

1401 6GT6CCCATG G6AATACCCC GCACCCTCAT GCGGACCACC CGCTTCCGA6 

1451 GGTACACCCT GCCCCAGGGC ACGGAGGTCT TCCCCCTCCT TGGCTCCATC 

1501 CTGCATGACC CCAACATCTT CAAGCACCCA GAAGAGTTCA ACCCAGACCG 

1551 TTTCCT6GAT GCA6ATGGAC GGTTCAGGAA 6CATGAGGCG TTCCTGCCCT 

1601 TCTCCTTAGG GAAGCGTGTC TGCCTTGGAG AGG6CCTGGC AAAAGCGGAG 

1651 CTCTTCCTCT TCTTCACCAC CATCCTACAA GCCTTCTCCC TGGAGAGCCC 

1701 GTGCCC6CCG GACACCCTGA GCCTCAAGCC CACCGTCAGT GGCCTTTTCA 

1751 ACATTCCCCC AGCCTTCCAG CTGCAAGTCC GTCCCACTGA CCTTCACTCC 

1801 ACCACGCAGA CCAGATGAAG GAAGGCAACT TGGAAGTGGT GGGTGCCCAG 

1851 GACGGTGCCT CCAGCCTCAA CAGTGGGCAT GGACAGGGTT AATGTCTCCA 

1901 GAGTGTACAC TGCAGGCAGC CACATTTACA CGCCTGCAGT TGTTrTCCGG 

1951 AGTCTGTCCC ACGGCCCACA CGCTCACTTG ACTCAT6CTG CTAAGATGCA 

2001 CAACCGCACA CCCATACACA ACTACAAGGG CCACAAAGCA ACTGCTGGGT 

2051 TAGCTTTCCA CAGACATAAA TATAGTCCAT CTGCAATCAC AAGCACATAG 

2101 CCAGGTAACC CACCAACTCC CCTGGATCTG CAGCCCACAC GTGGGAGTCT 

2151 GGCTGTCACC TTCACAAGCC ACAGAAACGG CCACACATGT TCACAGCTCA 

2201 CACGCCCTCT CCATTCATCG AACTTCTCAG TGTCCCTGTC CCTGGTGCCT 

2251 G6CACAGGGA ACAGCATGCC CCCTCCGGGG TCATGCCACC CAGAGACTGT 

2301 CGCTGTCTAT GGCCCCAACT CATGCTCCCT CTCTTGGCTA CACCACTCTC 

2351 CCAGCCTGTG ACCACCGATG TCCACACACC CCCAACCACT TGTCCACACA 

2401 GCTACCCACG TACGACATCG TCCTGGCTCC CCAGAGTATC TTCCCACTGA 

2451 GACACGCCGC CCCCACAGAG GCACAGTCCC CAGCCACCTC TGCAACTGCA 

2501 GCCCTCAGTC ACCCCTTTTT AAGCACCCTG ATTCTACCAA ATGCAAACAC 

2551 ATCTG6GTCT GC6ATTATGC ACA6AGACTT TGGACATACG AG6ACCCTCA 

2601 GACCGGAG6A ACACCTGCCC AACCCCAACA CGTGCTTATG TAACCACGTG 

2651 GAAAGCGGCC CCTGCTGCCC CTCCACACAC ACATACACAC TCACTGATCT 

2701 ACAGCCCCTG TTCGGCGTCA GAGTCCCCAC TAGACCCAGT GGAAGGGGTT 

2751 AGAGACCAAG TAGG6GCCAG TTTCCAATTC ACCCTGTCAG GGAGTGAGCC 

2801 GGAT CTGACG TTCCTTGTGA CTTAAGGGTC CGGCTTGG6A ATTAAAGTTT 

2851 GTTTCTGGCC TTTAGCCTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

2901 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA (SEQ ID N0:1) 

FEATURES: 
5'UTR: 1-303 
Start Codon:304 
Stop codon: 1815 
3'UTR: 1818 
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HOMOLOGOUS PROTEINS 
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117254 sp|P24461|CPG]_RABIT CYTOCHROME P450 2Gl (CYPIIGl) (P. 
404777 gb|AAA31432,l| (L10912) cytochrome P-450 2B-BX [Oryct. 
479930 pir||s35666 cytochrome P450 284 isoform Bx - rabbit >. 
117212 sp|P00178|CPB4„RABIT CYTOCHROME P450 2B4 (CYPIIB4) (P. 
320075 pir |S31277 cytochrome P450 2B4-B1 - rabbit >gi 121369. 
4731350|gb AAD28466.il (AF128849) cytochrome P450 2b10 relat. 
3123191 isp P04167|CPB2_RAT CYTOCHROME P450 2b2 (CYPIIBZ) (P4. 
117213|sp|Pl2789|CPB5^RABIT CYTOCHROME P450 2B5 (CYPIIBS) (P. 
89973|pir IA27717 cytochrome P450 2b5, hepatic (form HPl) - . 
2144292|pir| 104RTP2 cytochrome P450 2b2 - rat 



gFTBEl48597 
■ BF359243 
AW753778 



gb 



BE148597 MR0-HT0241-150500-010-b02 HT0241 Homo sapi . 
BF359243 RC6-ET0072'150600-011-f01 ET0072 Homo sapi. 
AW753778 RC1-CT0286- 301099-011- f 08 CT0286 Homo sapi. 



EXPRESSION INFORMATION FOR MODULATORY USE: 
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Tissue expression: 
HumarTi eukocyte 
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1 MEATG7WALL LALALLLLLT LALSGTRARG HLPPGPTPLP LLGNLLQLRP 

51 GALYSGLMRL SKKYGPVFTI YLGPWRPWV LVGQEAVREA LGGQAEEFSG 

101 RGTVAMLE6T FDGHGVFFSN GERWRQLRKF TMLALRDLGM GKREGEELIQ 

151 AEARCLVETF QGTEGRPFDP SLLLAQATSN WCSLLFGLR FSYEDKEFQA 

201 WRAAGGTLL GVSSQGGQTY EMFSWFLRPL PGPHKQLLHH V5TLAAFTVR 

251 QVQQHQGNLD ASGPARDLVD AFLLKMAQEE QNPGTEFTNK NMLMTVIYLL 

301 FAGTM7VSTT VGYTLLLLMK YPHVQKWVRE ELNRELGA6Q APSLGDRTRL 

351 PYTDAVLHEA QRLLALVPMG IPRTLMRTTR FRGYTLPQGT EVFPLLGSIL 

401 HDPNIFKHPE EFNPDRFLDA DGRFRKHEAF LPFSLGKRVC LGEGLAKAEL 

451 FLFFTTILQA FSLESPCPPD TLSLKPTVSG LFNIPPAFQL QVRPTDLHST 

501 TQTR (SEQ ID NO: 2) 



FEATURES: 

Functional domains and key regions: 

Prosite search results: 

[1] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

CAMP- and cGMP-dependent protein kinase phosphorylation site 
128-131 RKFT 



[2] PDOC00005 PS00005 PKC_PH0SPH03ITE 
Protein kinase c phosphorylation site 

Number of matches: 6 

1 61-63 SKK 

2 99-101 SGR 

3 248-250 TVR 

4 288-290 TNK 

5 378-380 TTR 

6 473-475 SLK 



[3] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase ii phosphorylation site 

Number of matches: 3 

1 119-122 SNGE 

2 192-195 SYED 

3 343-346 SLGD 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 10 

1 51-56 GALYSG 

2 109-114 6TFDGH 

3 115-120 GVFFSN 

4 188-193 GLRFSY 

5 207-212 GTLLGV 

6 257-262 GNLDAS 

7 284-289 GTEFTN 

8 339-344 GQAPSL 

9 370-375 GIPRTL 
10 444-449 GLAKAE 



[5J PDOC00009 PS00009 AMIDATION 
Amidation site 

Number of matches: 2 

1 140-143 MGKR 

2 435-438 LGKR 



[6] PDOC00029 PS00029 LEUCINEJIPPER 
Leucine zipper pattern 

Number of matches: 2 

1 32-53 LPPGPTPLPLLGNLLQLRPGAL 

2 39-60 LPLLGNLLQLRPGALYSGLMRL 
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[7] PDOC00081 PS00086 CYTOCHROM£_P450 

cytochrome P450 cysteine heme-iron ligand signature 

433-442 FSLGKRVCL6 

Membrane spanning structure and domains: 
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BLAST Alignment t T p Hit: 

>gi|117254|sp|P24461|CPGl_RABIT CYTOCHROME P450 2G1 (CYPIIGl) 
CP450-NMB) (OLFACTIVE) 

pir||B31944 cytochrome P450 2G1 - rabbit 
Length == 494 

Score * 516 bits (1315), Expect = e-145 

Identities = 248/491 (50^), Positives = 345/491 (6936), Gaps - 3/491 (036) 
Query: 1 meatgtwalllalal-lllltlalsgtrarghlppgptplpllgnllqlrpgalysglmr 59 

ME G + + LAL LL+ +A + G LPPGPTP+P LGNLLQ+R A + ++ 

sbjct: 1 melggaftiflalcfsclliliawkrvqkpgrlppgptpipflgnllqvrtdatfqsflk 60 
Query: 60 lskkygpvftiylgpwrpvwlvgqeavrealggqaeefsgrgtvamlegtfdghgvffs 119 

L +KYGPVFT+Y+GP RPW+L G EAV+EAL +A+EFSGRG +A +E F GHGV + 

Sbjct: 61 lrekygpvftvymgp-rpwilcgheavkealvdradefsgrgelasvernfqghgvala 119 
Query: 120 ngerwrqlrkftmlalrdlgmgkregeeliqaearclvetfqgtegrpfdpslllaqats 179 

NGERWR LR+F++ LRD GMGKR EE IQ EA L+E F+ T+G P DP+ L++ S 

Sbjct: 120 ngerwrilrrfsltilrdfgmgkrsieeriqeeagylleefrktkgapidptfflsrtvs 179 
Query: 180 nwcsllfglrfsyedkefqawraaggtllgvssqggqtyemfswflrplpgphkqllh 239 

NV+ S-H-FG RF YEDK+F +++R + + +S+ Q Y+M+S ++ LPG H ++ + 

sbjct: 180 nvisswfgsrfdyedkqflsllrminesfiemstpwaqlydmysgvmqylpgrhnriyy 239 
Query: 240 hvstlaaftvrqvqqhqgnldasgpardlvdafllkmaqeeqnpgteftnknmlmtviyl 299 

+ L F +V+ ++ +LD P RD +D FL+KM Q++ NP TEF KN+++T + L 

Sbjct: 240 lieelkdfiaarvkvneasldpqnp-rdfidcflikmhqdknnphtefnlknlvlttlnl 298 
Query: 300 lfagtmtvsttvgytllllmkyphvqkwvreelnrelgagqapslgdrtrlpytdavlhe 359 

FACT TVS+T+ Y LL+MK+P VQ + EE+N+ +G + PS+ DR ++P+TDAV+HE 
Sbjct: 299 FFAGTETVSSTLRY6FLLIMKHPEVQTKIYEEINQVIGPHRIPSVDDRVKMPFTDAVIHE 358 

Query: 360 aqrllalvpmgiprtlmrttrfrgytlpqgtevfpllgsilhdpnifkhpeefnpdrfld 419 

QRL +VPMG+P ++R T FRGY LP+GT+VFPLLGS+L DP F HP++F P FLD 
Sbjct: 359 IQRLTDIVPMGVPHNVIRDTHFRGYLLPKGTDVFPLLGSVLKDPKYFCHPDDFYPQHFLD 418 

Query; 420 ADGRFRKHEAFLPFSLGKRVCLGEGLAKAELFLFFTTILQAFSLESPCPPDTLSLKPTVS 479 

GRF+K+EAF+PFS GKR+CLGE +A+ ELFL+FT+ILQ FSL PP + + P +S 
Sbjct: 419 EQGRFKKNEAFVPFSSGKRICLGEAMARMELFLYFTSILQNFSLHPLVPPVNIDITPKIS 478 

Query: 480 GLFNIPPAFQL 490 

G NIPP ++L 
Sbjct: 479 GFGNIPPTYEL 489 (SEQ ID NO: 4) 

Hmmer search results (Pfam): 

Scores for sequence family classification (score includes all domains): 
Model Description score E-vaiue N 



PF00067 cytochrome P450 594.4 6.9e-175 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00067 ' 1/1 33 493 1 497 [] 594.4 6.9e-i75 
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1 NNNNNNNNNN NNNNNNNNNN 

51 NNNNNNNNNN NNNNNNNNNN 

101 NNNNNNNNNN NNNNNNNNNN 

151 NNNNNNNNNN NNNNNNNNNN 

201 NNNNNNNNNN NNNNNNNNNN 

251 NNNNNNNNNN NNNNNNNNNN 

301 NNNNNNNNNN NNNNNNNNNN 

351 NNNNNNNNNN NNNNNNNNNN 

401 NNNNNNNNNN NNNNNNNNNN 

451 NNNNNNNNNN NNNNNNNNNN 

501 NNNNNNNNNN NNNNNNNNNN 

551 NNNNNNNNNN NNNNNNNNNN 

601 NNNNNNNNNN NNNNNNNNNN 

651 NNNNNNNNNN NNNNNNNNNN 

701 NNNNNNNNNN NNNNNNNNNN 

751 NNNNNNNNNN NNNNNNNNNN 

801 NNNNNNNNNN NNNNNNNNNN 

851 NNNNNNNNNN NNNNNNNNNN 

901 NNNNNNNNNN NNNNNNNNNN 

951 NNNNNNNNNN NNNNNNNNNN 

1001 NNNNNNNNNN NNNNNNNNNN 

1051 NNNNNNNNNN NNNNNNNNNN 

1101 NNNNNNNNNN NNNNNNNNNN 

1151 NNNNNNNNNN NNNNNNNNNN 

1201 NNNNNNNNNN NNNNNNNNNN 

1251 NNNNNNNNNN NNNNNNNNNN 

1301 NNNNNNNNNN NNNNNNNNNN 

1351 NNNNNNNNNN NNNNNNNNNN 

1401 NNNNNNNNNN NNNNNNNNNN 

1451 NNNNNNNNNN NNNNNNNNNN 

1501 NNNNNNNNNN NNNNNNNNNN 

1551 NNNNNNNNNN NNNNNNNNNN 

1601 NNNNNNNNNN NNNNNNNNNN 

1651 NNNNNNNNNN NNNNNNNNNN 

1701 NNNNNNNNNN NNNNNNNNNN 

1751 NNNNNNNNNN NNNNNNNNNN 

1801 NNNNNNNNNN NNNNNNNNNN 

1851 NNNNNNNNNN NNNNNNNNNN 

1901 NNNNNNNNNN NNNNNNNNNN 

1951 TTGGATCGAA GAGGTCACAG 

2001 GCTGAGTAAG AAGTACGGAC 

2051 GGCCT6TGGT GGTCCTGGTT 

2101 GGTCAGGCTG AGGAGTTCAG 

2151 GACTTTTGAT GGCCATGGTA 

2201 GCCTGCCACC ACTTACT6GT 

2251 TGTTGCCTCA TCTGTCAAAT 

2301 CAGTGGCTAG TGCCTGAAAT 

2351 ATCACTTGAG GTCAGGAGTT 

2401 CTGTCTCTAC TAAAAATATA 

2451 GTAATCCCAG ATACTTGGGA 

2501 GAGGCAGATG TTGCAGTGAA 

2551 GCAACAGAGT 6AGCCTCCAT 

2601 TCATGTAGGA TTGAGTGATT 

2651 CACTCACTAA TCACCAGGTT 

2701 GGTAAAGCCC TGAGGGTTCA 

2751 GGGATTGGTG GGCAAGCCCT 

2801 ACAGAAAGCA GGAGGCCCAT 

2851 TAAGACAGCA CCGGTGCTGC 

2901 CAGACTCACC ACACAGTCCC 

2951 AGCACGGGGA GAAAGGTCAG 

3001 GGTCAGGGCC AGGCTGAGGG 

3051 GTACCTGACC CGGCATGTTT 

3101 AGAGAGGCTG TCGGAGCTGA 

3151 AGGCAGAAAG ACCAGCACCT 

3201 TCCATTCAAC TGATGAATTT 

3251 ATCCCAGCAC TTTGGAA6GC 

3301 TTTGTGACAA GCCTGGGCAA 

3351 CAAACAAACA AAAAATCATT 

3401 GAAATGACTC AGGCAGATAT 

3451 CTTATACTGC AGTAAGACAA 

3501 AGGTACAGAT TTGAGAGCAA 

3551 AAGAGGGAAT CAG6GAAGGC 

3601 CATGAAGGAT GAACGAGTTA 

3651 GAGTGATGGA GGCAGAGGGA 

3701 CCTGACGCCC TTGAGGAAGT 

3751 AAGTAACAAA GCTGAGAAGC 

3801 GAGTCACGTC A6AGTGTTTG 

3851 TTAAGCA66G AACAGCTGTA 



NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNTGACA GGGGCCATGA TGGAGACACC 
CACCCTCCTC TTTCTTCCTC CCTACCCCCA 
CGGTGTTCAC CATCTACCTG GGACCCTGGC 
GGGCAGGA6G CTGT6CGGGA GGCCCTGGGA 
CGGCCGGGGA ACCGTAGCGA TGCTGGAAGG 
AGTCAAGGGC TGCTAGGCCC TCCGCTCACA 
GTGTGACCTT T6CACATGGC TTAGTCCCTC 
GGAGTGATAA CAGTGCCCAT CAGCCGGGTG 
CCCAACACTT TGG6AGGCGG AGGTGGGTGG 
CGAGACCAGC CTGGCCAACA T6GTGAAACC 
AAAATTAGCT GGGCATGGTG GTGCGTACCT 
GGTTGAGGCA 6GAGAATCGC TTGAACCCGG 
CCAAGACTGT GCCACTGCAC TCCAGTCTGG 
CTCAAACAAA CAAACAAAAA GCAGTGCCCA 
GAGTGAG6AC TGAGCCTTGT GCAAAGTGAG 
GTAGTATCAG TGATAACCAT CAATGATCCA 
GAAAGATGCC GGAGCGCTTT CAAGGTGCTG 
CGAATAATAG AAACAGTTCT CTGTATTACA 
GCTGGGTGCT GCCAGGAACT CAGTAGTAAC 
TTCCCCAGCG CACCTAGGCC AGTGGGGAAA 
AGCCCAGAGT GGTCAGGGCC AAGATGGGGA 
GGTGGGATGG GGAGGGGTCA GGGCAAGAGG 
AAGCCCTGGG ACTGTAGGAA TTTAGAGGAG 
GGTGAGGGA6 ATTCAGGAAG TCTTCCTGGA 
GACTCATAAG ATGAGTGGGG AGGGTGTTCC 
ACAAAAGCAT GACTTTGAGA GAAGCATTCA 
TCAGACTGGG CACGCTGGCT CATGCCTGTA 
TGAATG6G6A GGATGACTTG A6CCTA6GCA 
CATGGTGA6A CCCTGCCTCC ACAAAACAAA 
ATACCTGGTA CCATGGGTAC CAGGTACATA 
GGTGTCCTCT CCTACTGTGG GAGAGGCGGG 
TAGAGGGAGG GAATATAATC CTAAAATGAG 
ACACAGGGCA CAGGCATATG TACGAGGGTA 
TTCTCAGAGA AGGTGACATT TAAGCCGGGA 
GTTCACCAAG GATGGGATGG AAAGGGGTGA 
ACTGCAGGAT CATAGGCCTA GACAGGG6AT 
GAGAGAAGAC CAGCGCAGTC GTAGTGGGTT 
CAGGGAAATC CCTGGTCATG CAGGGCCTGT 
GGCTTTTGTT TTTCCTG6GA GCAGTCGATT 
TTCAGAGTT6 GGAAGATCCT GTGGTTGCTG 
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3901 CCTGAAGG6G AT6AAACTGG AGGCTAGGAG CCCAGGGTGA TAGGGAG6AT 

3951 CCAGGGTGAT GGGGAGGCTG 66AGGTCCGC G6TGATGGAC CA6GGCTGGG 

4001 GCCAGGGGAT GG6GAGGAAG GAGTAATT6G GAGAGGCCTG GGGCTCTGGC 

4051 CGAGGAATGG ATGGTGGGCT GAAACA6GGA GAGGAGAGAT GCTTAGGCCA 

4101 CTTTGGAACA CAGTAGGGCA AGGACAG6AG ACACCCAAG6 GGAA6TGCCC 

4151 AAGAGACCAC GACA66CTGG CATTGGACAG G6AAGGTCTG TCTGGAGCAG 

4201 GTGTCTTGGA TAA6GGAGGA AAATGGTGCA GTTCCATCCT CCTCCCTCTC 

4251 TGTTCAACCT CTAAACTACA TGGGGCACAG GACCCAGTGG GACTCCATAA 

4301 ATGATGGGAT GGGTG6ATGG AAGGAAGGAA GGAGGAAACA ACTCTTCATT 

4351 CATCCTGGTT ATTTACAGAA CAGGCCAGGT GCGGTGCTCA CGCTTGCCAT 

4401 TCTAGCACTT TGGGAGGCTG AGGTGGGTGG ATTACCTCAG GTCAGGAGTT 

4451 CAAGACCAGC CTAGACAACG TAGA6AAACC CCATCTCTAC TGAAGATATA 

4501 AAATTAGCTG GGCGTAGTGG CATATGCCTG TAATCCCAGC TAGTCGGGAA 

4551 GCTGAGGCAG GAGAATCGCT TGAACCCGAG AGGCAGAGGT TGCGGTGAGC 

4601 TGAGATC6TG CCATTGCACT CCAGCCTGGG T6ACAAA6CA AGACCTCGTC 

4651 TCAATAATAA TAATAATTAC AAAACAGAAG GA6CCTGGGT CATCCCAGCT 

4701 ACCTACTTTT CAGGAGAATG TACTCCCTTA CCCAAGGGCA AAGGATGGGA 

4751 GAACCAGTTT GATTATGCAT TTATTGAGCA CCTACTGAGT CCTCATCCCT 

4801 GGGCTAGGCT GGAATGGACT CAGATGGAGC CTGAAGAGTC CCCCTCAGGG 

4851 AACCTCACTA GAAAGAAGGA GGAATCGGCC GGGCGCGGTG GCTCACGCCT 

4901 GTAATCCCAA CACTTTGGGA GGCTGAGGTG GGTGGATCAC AAGGTCAGGA 

4951 6ATCGAGACC ATCCTGGCTA ACACAGTGAA ACCCCATCTC TACTAAAAAT 

5001 ACAAAAAATG AGCCAGGCAT GGTGGCGGGC GCCTGTAGTC CCAGCTACTC 

5051 AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCA GAGGTTGCAG 

5101 TGAGACGA6A TCACGCCACT GCACTCCAGC CTGGGCAACA GA6CGAGATT 

5151 CCGTCTCAAA AAAAAAAAGA AAGAAAGGAA GAAGGGGGAA TGGGGGAGAG 

5201 GGGCCGGTCC CI I I I TGAGT CTAGCCTTCT GCGCAGGGGT TTTCTTCTCC 

5251 AACGGGGAGC GGT6GAGGCA GCTGAGGAAG TTTACCATGC TT6CTCTGCG 

5301 GGACCTGGGC ATGGGGAAGC GAGAAGGCGA GGAGCTGATC CAGGCGGAGG 

5351 CCCGGTGTCT G6TG6AGACA TTCCAGGGGA CAGAAGGTCA GCATGGCGGG 

5401 GTCACCCCAG GGTCTCCAGC CGAGTGAAAG GGAAAACTCT CCTACTGTGG 

5451 CTGGGGGTGG CCCCAACCCA GGTCCTGGAA TGGGCAGGAG GGGAAGCCTT 

5501 GAACTCTAGG GCTGGCCTGG GGGTTCTGTT CACTGCCACC TTCTGTCTCT 

5551 GTCCCACTGT CTCTCCGAGG CTGTCATGAC ATCTCTCTGT GTGTCTCTGG 

5601 TGCTATCATC CCATTCTTCC T GGGTCT CCA TCTCTCTCTC TCTCTCTTTT 

5651 CTTTCTCTCT CCTTTCCTCT ATTTTTTGGG CCCTCAGTCT ATCTCTGTTT 

5701 CTGTCTCCCT GTCTGTGTGA TGGTCACTCT GTTTCTTTCT CCCTGTCTGT 

5751 TTCTCTGTCC CTATCT6TCT GTATCCTTCT TTGCCTGTTT AGCTCTCTCC 

5801 CTGCGCTGTC CATCCATCTT TCCCTGCCTC CCTGTCTCTC TCTGGTTGGG 

5851 TTCAGCCCCA ACCTGCTCCC CTCTGCCTGG CTCCATCACA GCCTACCTCC 

5901 CTGCCCCCAT TCCCCCCAGG ACGCCCATTC GATCCCTCCC TGCTGCTGGC 

5951 CCAGGCCACC TCCAACGTAG TCTGCTCCCT CCTCTTTGGC CTCCGCTTCT 

6001 CCTATGAGGA TAAGGAGTTC CAGGCCGTGG TCCGGGCAGC TGGTGGTACC 

6051 CTGCTGGGAG TCAGCTCCCA GGGGGGTCAG GTGAGTGGGT GGGACCCCTC 

6101 TCCAACTACC TTCCCTGAAG GTTCCTGCCA AGGTCCCATG AGAACTAGCT 

6151 GCCCTTCTCC CCACAGACCT ACGAGATGTT CTCCTGGTTC CTGCGGCCCC 

6201 TGCCAGGCCC CCACAAGCAG CTCCTCCACC ACGTCAGCAC CTTGGCTGCC 

6251 TTCACAGTCC GGCAGGTGCA GCAGCACCAG GGGAACCTGG ATGCTTCGGG 

6301 CCCCGCACGT GACCTTGTCG ATGCCTTCCT GCTGAAGATG GCACAGGTGT 

6351 GGGAAGGGTG CAGGGACCCC CTCTCTGAAT GGGCGTGGTG ACCTGGCAGG 

6401 TCCCA6CCAG GTGTCCCTGG GGACCTCAAT TGGGTTCCTC TCTCTTTCTC 

6451 TCTCTGCATG TCTCTGTGAG TATGAGTGTC TCTGTGCATG TGTGTGCATC 

6501 CCTTCrCTGC ACATCTGTGC TGGCCCTTTC AGGGC6TTGC TCTCACTGCC 

6551 TCTCCCGCCC CCGACCTGGG CATTTGTGCC GG6CTGTCTG TCTCTCCAGC 

6601 ATCTCTCCTC TTTCTCCCTC CCACCTCGGC CCTTGTGTTC AGGCCCCATG 

6651 CCCAGGGTCC TACACCAGCA ATCCCCAGGA TCACTTCATC CCATCCCCTG 

6701 CAGCCTCCCC AGACTTTTAT GTAAATTCAC AATTTTATGT GAATTATGGT 

6751 CATTTATTAG GAAGCCTTGC AATATCAAGT TATGTTAATA AAGTCCACTT 

6801 TATTAATTAT ATAAGAACAA TATTTCTTTT CCTTTTTTTT TT C II I I CM 

6851 TTTAAAGAGA CAGGATCTCT TTCTGTTGCC CAGGCTAGAG TACAGTTGCA 

6901 AAATCATAGC TCACTGCAAC CTT6AACTCC TGGGCTCAA6 CAATCCTCCT 

6951 GCCTCGGGCT CCTGAGTA GC TGGGACAACA GGTGTGCACC ACCACACCTG 

7001 GCTAAATTTT TTTTTTTTCT TTGTAGAGAT AGACTCTCAC TATGTTACCC 

7051 AGGCTGGTCT TGAATTCCTG GGCTCATGTA ATCCTCCTGC TGCCTTGAAC 

7101 TCCCAAAGTG CTGGGACTAT AGGCATAAGA CATCATGCCC GGTCGGGCAC 

7151 AGTGGCTCAT GCCTGTAATC TCAGGACTTT GGGAGGCCGA GACGGGCGGA 

7201 TCACCTGA6G TC6GGAGTTC GAGACCAGCC TGACCAACAT GGAGAAACCC 

7251 CATCTCTACT AAAAAAAAAA ATACAAAATT AGCCGGACGT GGTGGCACAT 

7301 GCCTGTAATC CCAGCTACTA GGGAGGCTGA GGCAGGAGAA TCGCTTGAAC 

7351 CCGGGAGGCT TAGGTTGCGG TGAGCTGAGA TTGCACCATT GCACTCCAGC 

7401 CTGGGCAACA AGAGCGAAAT TCCATCTCAA AAAAAAAAAA AAAGAAAAAA 

7451 AGAAAAAAGA CACCATGCCC TATAAGTAAA CTAGAATTAA GGTGACTCCT 

7501 AAGGAAATAA ATAGI III lA ACTGTACGAA CTTTTGGAAG AATGGGGCCA 

7551 ATTCTTTAAT TAAATGCAGC CTCCCTGTTT GTGGAGAAAG AAAAATTTTT 

7601 CTTAACCCTA TTGCCCCATT TCTTTTCTCT TTTATTGAAT A ll I I I lA GT 

7651 TTTAACTATA GTAAAATACA CATAACGTTT ACCATCTTAA CCATTTTTAG 

7701 6TATACAGTA CA6TAGTGTT CA6TACATTC ATACTGTTAT GCAATCAGTC 

7751 TCCAGAACTC TTCATGTTGC AAAGCTGAAA CTCTATACCC ATTAAACAAC 
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7801 TGCCTGTTCC TCCCTCCTCC 

7851 GACGAAGTCT CACTCTGTCA 

7901 GCTCACTGCA AGCTCCGCCT 

7951 CTCCCAAGCA GCTGGGACTA 

8001 TTTGTATTTT TAGTAGAGAT 

8051 CAAACTCCTG GCCTCAAGTG 

8101 GGACTACAGG CGTGAGCCAC 

8151 TTAGTTCTCT CTGGTT6AAA 

8201 AACAAAAACA CAGAAAGTTA 

8251 AACCCATGTA TACCATTGGT 

8301 AATGGTACAG CAGTTACCTG 

8351 CATGCAGAAA CCCCGTCTCC 

8401 NNNNNNNNNN NNNNNNNNNN 

8451 NNNNNNNNNN NNNNNNNNNN 

8501 NNNNNNNNNN NNNNNNNNNN 

8551 NNNNNNNNNN NNNNNNNNNN 

8601 NNNNNNNNNN NNNNNNNNNN 

8651 NNNNNNNNNN NNNNNNNNNN 

8701 NNNNNNNNNN NNNNNNNNNN 

8751 NNNNNNNNNN NNNNNNNNNN 

8801 NNNNNNNNNN NNNNNNNNNN 

8851 NNNNNNNNNN NNNNNNNNNN 

8901 NNNNNNNNNN NNNNNNNNNN 

8951 NNNNNNNNNN NNNNNNNNNN 

9001 NNNNNNNNNN NNNNNNNNNN 

9051 NNNNNNNNNN NNNNNNNNNN 

9101 NNNNNNNNNN NNNNNNNNNN 

9151 NNNNNNNNNN NNNNNNNNNN 

9201 NNNNNNNNNN NNNNNNNNNN 

9251 NNNNNNNNNN NNNNNNNNNN 

9301 NNNNNNNNNN NNNNNNNNNN 

9351 NNNNNNNNNN NNNNNNNNNN 

9401 NNNNNNNNNN NNNNNNNNNN 

9451 NNNNNNNNNN NNNNNNNNNN 

9501 NNNNNNNNNN NNNNNNNNNN 

9551 NNNNNNNNNN NNNNNNNNNN 

9601 NNNNNNNNNN NNNNNNNNNN 

9651 NNNNNNNNNN NNNNNNNNNN 

9701 NNNNNNNNNN NNNNNNNNNN 

9751 NNNNNNNNNN NNNNNNNNNN 

9801 NNNNNNNNNN NNNNNNNNNN 

9851 NNNNNNNNNN NNNNNNNNNN 

9901 NNNNNNNNNN NNNNNNNNNN 

9951 NNNNNNNNNN NNNNNNNNNN 

10001 NNNNNNNNNN NNNNNNNNNN 

10051 NNNNNNNNNN NNNNNNNNNN 

10101 NNNNNNNNNN NNNNNNNNNN 

10151 NNNNNNNNNN NNNNNNNNNN 

10201 NNNNNNNNNN NNNNNNNNNN 

10251 NNNNNNNNNN NNNNNNNNNN 

10301 NNNNNNNNNN NNNNNNNNNN 

10351 NNNNNNNNNN NNNNNNNNNN 

10401 NNNNNNNNNN NNNNNNNNNN 

10451 NNNNNNNNNN NNNNNNNNNN 

10501 NNNNNNNNNN NNNNNNNNNN 

10551 NNNNNNNNNN NNNNNNNNNN 

10601 NNNNNNNNNN NNNNNNNNNN 

10651 NNNNNNNNNN NNNNNNNNNN 

10701 NNNNNNNNNN NNNNNNNNNN 

10751 NNNNNNNNNN NNNNNNNNNN 

10801 NNNNNNNNNN NNNNNNNNNN 

10851 NNNNNNNNNN NNNNNNNNNN 

10901 NNNNNNNNNN NNNNNNNNNN 

10951 NNNNNNNNNN NNNNNNNNNN 

11001 NNNNNNNNNN NNNNNNNNNN 

11051 NNNNNNNNNN NNNNNNNNNN 

11101 NNNNNNNNNN NNNNNNNNNN 

11151 NNNNNNNNNN NNNNNNNNNN 

11201 NNNNNNNNNN NNNNNNNNNN 

11251 NNNNNNNNNN NNNNNNNNNN 

11301 NNNNNNNNNN NNNNNNNNNN 

11351 NNNNNNNNNN NNNNNNNNNN 

11401 NNNNNNNNNN NNNNNNNNNN 

11451 NNNNNNNNNN NNNNNNNNNN 

11501 NNNNNNNNNN NNNNNNNNNN 

11551 NNNNNNNNNN NNNNNNNNNN 

11601 NNNNNNNNNN NNNNNNNNNN 

11651 NNNNNNNNNN NNNNNNNNNN 



AACCCCTGGC AATCACCTTT 111 III I I GA 
CCCAGGCTAG AGTGCGGTGG CTCGATCTCG 
CCCGGGTTCA TGCCATTCTC CTGCCTCAGT 
CAGGTGCCCG TCACCACGCC TGGCTAATTT 
GGA6TTTCAT CGTGTTAGCC AGGCTGATCT 
ATCCACCCGC CTCGGCCTCC CAAA6TGCTG 
TGTGCCTGGC CAGGAAGTAG ACTCTTGATA 
T6TTTTTAAA AATGAAAGAG AATGACTAAT 
TAAGGATTGA TGAAGATGTG GAGACTTTGA 
GGGAATGTGA AACGACGCAG CCCTGTGGAA 
AGGTCAGGAG TTTGAAACCA ACCTGGCCAA 
ATTAAATGTA CAAAAATTAG CCAGGNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNCTCT TTCTTCCTTT CTTCCTTCCT 
CCCTTCCTCC CTTCCTCCCT TCCTGTCTTC CTCTCTTTCT CTCTTTCTTT 
CTT6ACAGGG TCTCTCTTTG TCTCCCA6GC TGGG6TGCAG TGGTACAAGC 
ATAGCTCACA 6CAGCCTTGA ACTCCTAGGC TCAAGT6ATC CTCCCACGTC 
AGCCTCCTGA GCAGCTGGGA CAACGGGCTC ATACCACCAT GCCT6GCTAA 
TTTTTTAATT TTTCGTAGAG ACAAG6TCTT 6TTATATTGC CCAGGCTGGT 
CTCAAACTCC TGGGCTCAAA TGCTTCTCTC ACCTCAGCCT CCCACGTGGC 
TG6GATTACA GGCATGAGCC ACTGCACGCC ACTCAACACT CCACAAATGT 
TGATGCCATT AT6TTTTGTG AACTAGTGTC CCTGGCACCC GAGACTTGTA 
CTCCACACTC GAGGACCAAA TAGACTGGGG TGGGAAGGGG TTTATAGTTT 
CATTATTATT TCCCCTCAGG GCACGGAGGT CTTCCCCCTC CTT6GCTCCA 
TCCTGCATGA CCCCAACATC TTCAAGCACC CAGAAGAGTT CAACCCAGAC 
CGTTTCCTGG ATGCAGATGG ACGGTTCAGG AAGCATGAGG CGTTCCTGCC 
CTTCTCCTTA GGTATCTGCT GCAGCCCTGG GTATCACAAG CAGGTGCTGG 
CGAACTCCAG GCATCTGTGC CAGCTGGGGG CACCCTTCTG CACCCTGGGC 
TTACT6TTGG CTCCTCCACC TGCTGTTCCC CCCGTGGGCC TGGGTGTGAG 
GAATACTGAC TCA6CCCTCT CTCTCTCTCT CTCCTCACCA GGGAAGCGTG 
TCTGCCTTGG AGAGGGCCTG GCAAAAGCGG AGCTCTTCCT CTTCTTCACC 
ACCATCCTAC AAGCCTTCTC CCTGGAGAGC CCGTGCCC6C CGGTACACCC 
TGAGCCTCAA GCCCACCGTC AGTGGCCTTT TCAACATTCC CCCAGCCTTC 
CAGCTGCAAG TCCGTCCCAC TGACCTTCAC TCCACCACGC AGACCAGATG 
AAGGAAGGCA ACTTGGAAGT GGTGGGTGCC CAGGACGGT6 CCTCCAGCCT 
CAACAGTGGG CATGGACAGG GTTAATGTCT CCAGAGTGTA CACTGCAGGC 
AGCCACATTT ACACGCCTGC AGTTGTTTTC CGGAGTCT6T CCCACGGCCC 
ACACGCTCAC TT6ACTCATG CTGCTAAGAT GCACAACCGC ACACCCATAC 
ACAACTACAA GGGCCACAAA GCAACTGCTG GGTTAGCTTT CCACAGACAT 
AAATATAGTC CATCTGCAAT CACAAGCACA TAGCCAGGTA ACCCACCAAC 
TCCCCTGGAT CTGCAGCCCA CACGTGGGAG TCTGGCTGTC ACCTTCACAA 
GCCACAGAAA CGGCCACACA TGTTCACAGC TCACACGCCC TCTCCATTCA 
TCGAACTTCT CAGTGTCCCT GTCCCTGGTG CCTGGCACAG GGAACAGCAT 
GCCCCCTCCG GGGTCATGCC ACCCAGAGAC TGTCGCTGTC TATGGCCCCA 
ACTCATGCTC CCTCTCTTGG CTACACCACT CTCCCAGCCT GTGACCACCG 
ATGTCCACAC ACCCCCAACC ACTTGTCCAC ACAGCTACCC ACGTACGACA 
TCGTCCTGGC TCCCCAGA6T ATCTTCCCAC TGAGACACGC CGCCCCCACA 
GAGGCACAGT CCCCAGCCAC CTCTGCAACT GCAGCCCTCA GTCACCCCTT 
TTTAAGCACC CTGATTCTAC CAAATGCAAA CACATCTGGG TCTGCGATTA 
TGCACA6AGA CTTTGGACAT ACGAGGACCC TCAGACCGGA GGAACACCTG 
CCCAACCCCA ACACGTGCTT ATGTAACCAC GT6GAAAGCG GCCCCTGCTG 
CCCCTCCACA CACACATACA CACTCACTGA TCTACAGCCC CTGTTCGGCG 
TCAGAGTCCC CACTAGACCC AGTGGAAGGG GTTAGAGACC AAGTAGGGGC 
CAGTTTCCAA TTCACCCTGT CAGG6AGTGA GCCGGATCTG ACGTTCCTTG 
TGACTTAAGG GTCCGGCTTG GGAATTAAAG TTTGTTTCTG GCCTTTA6CC 
TACTGCGTGT GTGACCCGTG TCAGTCACTG TGAGTAAGGG GTGGGGACAG 
GGGAGTCCAC CCCTCCCCTG AGGCTGGGCG GGAGCTGAAA AACATGGCCA 
CCGCCCACCC TGGCTGTTGA CATCAGGACC AGATGTGGAG CTGGGAGGAG 
GGGCAGGGCT GGTGACGCCC TGGGCCTCAT TTCCAAAAAG GGCCAAGGTG 
TCCGGCGGTG GGAAGTGGGC AAGGAGGGGG TAACCCAAGC TGGACT6TG6 
ACCTTGGGGG CTTCCTCA6C CAGGGAGAGC CTGAAGCCAA CTAGATCCAG 
ACCCTAGAGA CTCTTCAAAC TTGAGTACAG GAACTAGCTT GCAACACAGA 
CTCTAAGCCC ACTCCCATTT CTTCCACCCT TTTTCTCTTG CCTCCCCTTC 
ACAAGGAAA C CAGAGGCATT TGTAAATTTC CM IC I I II I I II I I 1 I I M 
TTTTTTTTGA GACGGAGTCT CACTCTGTCA CCCAGGCTGG AGTGCAGTGG 
TGTGATCTTG GCTCACTGCA GCCTCCGCCT CCGGGTTCAA GCCATTCTCC 
TGCCT CAGCC TCCCAAGTAG CTGGGATTAC AGGTGTGTGC CACCACGCCC 
AGCTAATTTT TGTAI M MA GTA6AGATGG GGTTTCACCA TGTTGGCCAG 
GCTGGTCTCG AACTCCTGAC CTCAGATGAT CTGCCAGTCT CGGCCTCCCA 
AAATGCTGGG ATTACAGGCG TGAGTCGCTA CTAGATAAAT TTCTTATCTA 
GCAAAGAAGT TTGCAAACAT ACGCAAAAGT AGAAAGATAC AATGAGCCCC 
CAGGTGCCCA TCACCCAGCC TCATTTCAAT AGTCATCAAC TTTCTGCAGC 
TTTTACTTCA TCTATATCCT TTTCTGCCTC TMMMMI TTTTTATTTT 
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GAGATAGGGT TTTGCTTTGT TGCCCAAGCT GGGGTGCAGT A6CATGATCT 
CATAGTTCAC TGGGGCTTCA GACTCCTAGG CTCAAGTGAT CCTCCCGCCT 
CGGCCTCCAA GCAGCTGGGA CTACAGATGC GTGCCACCAC ACCCAGCTAA 
ATTTCTTATT TTTATTTTCT ATAGAGAAAG TCTCACTATA CAGCCCTGTG 
CTGGTCTCAA ATTCCA6GCC TCAA6A6TTT CCATCCCAGC CTCCCAAAGT 
GCTGGGATTA TAGGC GTGAG TCACTGCACC CTGCCCTAAT A l l I I l ATTT 
TATCTATTGC TTTTTATTTA CTTATTTATC TTTTA I (HI GAGACAGAGT 
CTCACTCTGT GGCCCATGCT GGAGTGCAGT GGCATCATCT CGGCTCACTG 
TAACCTCCGC CTCTTAGGTT CAAGCA6TTC TCCTGCCTT6 ACCTCCCGAG 
TAGCTGGAAT TACAGGTGCC TGCCACCAAG CCTGGCTAAT I I I i I l ATTT 
GTAGTAGAGA TGGGGI IT IG CCATGTTGAC CAGGCTGGTC TCGAACTCCT 
GACCTCAGGT GATCTGCCCA CCTTGGCCTC CCAAAGTGCT GAGATTACTG 
GTATGAGCCA CCGT6CCTGG CCACCTATTG CTTTTTAAAG ATTA I III I i 
TATTATTATT ATTTTTTTAT TTGCAGATGG AGTTTCGCTA TTGTTGCCCA 
GGCTGGAGTG CAATGGCGTG ATCTCAGCTC ACCGCAACCT CCGCCTCCCA 
GGTTCAAGCG ATTCTCCTGC CTCAGCATCC CTAGTAGCTG GGATTACAGG 
CATGCACCAC CATGTCCAGC TAATTTTGTA I I I I lAGTAG AGACGAGGTT 
TCTCCAGGTT GGTCAGGCTG GTCTCAAACT CCCAACCTCA GGTGATCCGC 
CCATCTCGGC CTCCC AAAG T GCTGGGATTA CAG6TGTGAG CCACC6CGCC 
TGGCCTTAAA GATTATTTTA AGGCAAATTA CAGAAAGCAA TTTAAT6CAC 
ATTTCTGAGA GTTAAAGATA Ml ! I G CCCT TGACATTTTA TGAGGACAGT 
TTTCAAACAT GCAGCAAAGT TGAG66AATT GTACAAG6AA CACCTTGTGC 
ACTCCTGCCT C AGTCTC CCA AGCAGCTGGG ACTACAGGTG CCCGTCACCA 
CGCCTGGCTA ATTTTTTGTA II II lAGTAG AGATGGAGTT TCATCGTGTT 
AGCCAGGCTG ATCTCAAACT CCTGGCCTCA AGTGATCCAC CCGCCTCGGC 
CTCCCAAGTG CTGGGACTAC AGGCGTGAGC CACTGTGCCT GGCCAGGTAA 
GTAGACTCTT GATATTAGTT CTCTCTGGTT GAAATGTTTT TAAAAATGAA 
AGAGAATGAC TAATAACAAA AACACAGAAA GTTATAAGGA TTGATGAAGA 
TGTGGAGACT TT6AAACCCA TGTATACCAT TGGTGGGAAT GTGAAACGAC 
6CAGCCCTGT GGAAAATGGT ACAGCAGTTA CCTGA6GTCA GGAGTTTGAA 
ACCAACCTGG CCAACATGCA GAAACCCCGT CTCCATTAAA TGTACAAAAA 
TTAGCCAGGC ATGGTGGTGC GCACCTGTAA TCCCAGCTAC TCGGGAGGCT 
GAGGCA6GA6 AATT6CTTGA ACCCAGGAGG CGGAG6TTGC AGTGAGCCGA 
GATCGTGCCA CTGCACTCAG CCTGGGCAAC AAAGCAAGAC TCTGTCTCAA 
AAAAAAAAAG TCTACTTCCC AACCTTCCCA AAAATTTATC TAAACCCCGT 
GACAAAACTT TAACTTGTGT TTCCGACCCC AGGCTTGGCT GTTCTGGACA 
TTTACTTCCC AAAGGCTGTG TTCTCTCAGC CCCTCTGCCT GGTTTCTTTC 
AGGAGGAACA AAACCCAGGC ACAGAATTCA CCAACAAGAA CAT6CTGATG 
ACA6TCATTT ATTTGCTGTT TGCTGGGACG ATGACGGTCA GCACCACGGT 
CGG CTAT ACC CTCCTGCTCC TGAT6AAATA CCCTCATGTC CAAAGTAAGA 
GCCTTTTCCA CTTGCCAGGC CTTGGGAACA GAAGTCAGGG TTCTAGGCTG 
AGCAAGGTGG CTCACGCCTA TAATCCCAGC ACTTTGGGAG GCTGAGGC6G 
GCTGATCACT TGAGAATAGG AGTTTAAGAC CAGCCGGCCA ACACAGTGAA 
AC (SEQ ID NO: 3) 



1999 6349 
1999 2167 



2168 5236 

5237 5386 

5387 5919 

5920 6080 

6081 6166 

6167 6349 

6354 13542 

6354 6636 

6637 13087 

13088 13261 

13262 13441 

13442 13542 

13547 13648 

13547 13648 
13649 



Major Minor 

G C A 

C A G T 

A T G C 

A T 

T C G 

T C G 

T G A C 

A C T G 

C GAT 
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6385 C T G 

6813 C T A 

7853 C T 

12973 G C T 

12973 A G T 

13012 G ACT 

13072 A T C G 

13370 C G 

13682 C T A 

14631 C T G 



Context: 

DNA 

Position 

2226 TGACAGGGGCCATGATGGAGACACCTTGGATCGAAGAGGTCACAGCACCCTCCTCTTTCT 
TCCTCCCTACCCCCAGCTGAGTAAGAAGTACGGACCGGTGTTCACCATCTACCTGGGACC 
CTGGCGGCCTGTGGTGGTCCTGGTTGGGCAGGAGGCTGTGCGGGAGGC CCTG GGAGGTCA 
GGCTGAGGAGTTCAGCGGCCGGGGAACCGTAGCGATGCTGGAAGGGACTTTT6ATGGCCA 
TGGTAAGTCAAGGGCTGCTAGGCCCTCCGCTCACAGCCTGCCACCACTTACTGGTGTGTG 
[G,C,A] 

CCTTTGCACATGGCTTAGTCCCTCTGTTGCCTCATCTGTCAAATGGAGTGATAACAGTGC 
CCATCAGCCGGGTGCAGTGGCTAGTGCCTGAAATCCCAACACTTTGGGAGGCGGAGGTGG 
GTGGATCACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCT 
CTACTAAAAATATAAAAATTAGCTGGGCAT6GTGGTGCGTACCTGTAATCCCAGATACTT 
GGGAGGTTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGATGTTGCAGTGAACCAAGA 

2226 TGACAGGGGCCATGATGGAGACACCTTGGATCGAAGAGGTCACAGCACCCTCCTCTTTCT 
TCCTCCCTACCCCCAGCTGAGTAAGAAGTACGGACCGGTGTTCACCATCTACCTGGGACC 
CTGGCGGCCTGTGGTGGTCCTGGTTGGGCAGGAGGCTGTGCGGGAGGC CCTG GGAGGTCA 
GGCTGAGGAGTTCAGCGGCCGGGGAACCGTAGCGATGCTGGAAGGGACTTTTGATGGCCA 
TGGTAAGTCAAGGGCTGCTAGGCCCTCCGCTCACAGCCTGCCACCACTTACTGGTGTGTG 
[CA.GST] 
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CCTTTGCACATGGCTTAGTCCCTCTGTTGCCTCATCTGTCAAATGGAGTGATAACAGTGC 
CCATCAGCCGGGTGCA6TGGCTAGTGCCTGAAATCCCAACACTTTGGGAGGCGGAGGTGG 
GTGGATCACTTGAGGTCAGGAGTTCGAGACCAGCCrGGCCAACATGGTGAAACCCTGTCT 
CTACTAAAAATATAAAAATTAGCTGGGCATGGTGGTGCGTACCTGTAATCCCAGATACTT 
GGGAGGTTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCA6ATGTTGCAGTGAACCAAGA 

3081 AMCAGTTCTCTGTATrACAACAGAAAGCAGGAGGCCCATGCTGGGTGCTGCCAGGAACT 
CAGTAGTAACTAAGACAGCACCGGTGCTGCTTCCCCAGCGCACCTAGGCCAGTGGGGAAA 
CAGACTCACCACACAGTCCCAGCCCAGAGTGGTCAGGGCCAAGATGGGGAAGCACGGGGA 
GAAAGGTCAGGGTGGGATGGGGAGGGGTCAGGGCAAGAGGGGTCAGGGCCAGGCTGAGGG 
AAGCCCTGGGACTGTAGGAATTrAGAGGAGGTACCTGACCCGGCATGTTTGGTGAGGGAG 
[A,T,GSC] 

TTCAGGAAGTCTTCCTGGAAGAGAGGCTGTCGGAGCTGAGACTCATAAGATGAGTGGGGA 
GGGTGTTCCAGGCAGAAAGACCAGCACCTACAAAAGCATGACTTTGAGAGAAGCATTCAT 
CCATTCAACTGATGAATTTTCAGACTGGGCACGCTGGCTCATGCCTGTAATCCCAGCACT 
TTGGAAGGCTGAATGGGGAGGATGACTTGAGCCTAGGCATTTGTGACAAGCCTGGGCAAC 
ATGGTGAGACCCTGCCTCCACAAAACAAACAAACAAACAAAAAATCATTATACCTGGTAC 

3788 ATCCTAAAATGAGAGGTACAGATTTGAGAGCAAACACAGGGCACAGGCATATGTACGAGG 
GTAAAGAGGGAATCAGGGAAGGCTTCTCAGAGAAGGTGACATTTAAGCCGGGACATGAAG 
GATGAAC6AGTTAGTTCACCAAGGATGGGATGGAAAGGGGTGAGAGTGATGGAGGCAGAG 
GGAACT6CAGGATCATAGGCCTAGACAG6GGATCCTGACGCCCTTGAGGAAGTGAGAGAA 
GACCAGCGCAGTCGTAGTGGGTTAAGTAACAAAGCTGAGAAGCCAGGGAAATCCCTGGTC 
[A,T] 

TGCA GGGCCTGTGAGTCACGTCAGAGTGTTTGGGCTTTTGI I I I I CCTGGGAGCAGTCGA 
TTTTAAGCAGGGAACAGCTGTATTCAGAGTTGGGAAGATCCTGTGGTTGCTGCCTGAAGG 
GGATGAAACTGGAGGCTAGGAGCCCAGGGTGATAGGGAGGATCCAGGGTGATGGGGAGGC 
TGGGAGGTCCGCGGTGATGGACCAGGGCTGGGGCCAGGGGATGGGGAGGAAGGAGTAATT 
GGGAGAGGCCTGGGGCTCTGGCCGAGGAATGGATGGTGGGCTGAAACAGGGAGAGGAGAG 

3979 ATCATAGGCCTA6ACAGGGGATCCTGACGCCCTTGAGGAAGTGAGAGAAGACCAGCGCAG 
TCGTAGTGGGTTAAGTAACAAAGCTGAGAAGCCAGGGAAATCCCTGGTCATGCAGGGCCT 
GTGAGTCACGTCAGAGT6TTTGGGC I I I IGI II I ! CCTGGGAGCAGTCGATTTTAAGCAG 
GGAACAGCTGTATTCAGAGTTGGGAAGATCCTGTGGTTGCTGCCTGAAGGGGATGAAACT 
GGAGGCTAGGAGCCCAGGGTGATAGGGAGGATCCAGGGTGATGGGGAGGCTGGGAGGTCC 
[T,C.G] 

CGGTGATGGACCAGGGCTGGGGCCAGGGGATGGGGAGGAAGGAGTAATTGGGAGAGGCCT 
GGGGCTCTGGCCGAGGAATGGATGGTGGGCTGAAACAGGGAGAG6AGAGAT6CTTA6GCC 
ACTTTGGAACACAGTA6GGCAAGGACAGGAGACACCCAA66GGAAGTGCCCAAGAGACCA 
CGACAGGCTGGCATTGGACAGGGAAGGTCTGTCTGGAGCAGGTGTCTTGGATAAGGGAGG 
AAAATGGTGCAGTTCCATCCTCCTCCCTCTCTGTTCAACCTCTAAACTACATGGGGCACA 

5056 AGTTTGATTATGC^TTTATTGAGCACCTACTGAGTCCTCATCCCTGGGCTAGGCTGGAAT 
GGACTCAGATG6AGCCTGAAGAGTCCCCCTCAGGGAACCTCACTAGAAAGAAGGAGGAAT 
C6GCCGGGCGCGGTGGCTCACGCCTGTAATCCCAACACTTTGGGAG6CTGAGGTGGGTGG 
ATCACAAGGTCAGGAGATCGAGACCATCCTGGCTAACACAGTGAAACCCCATCTCTACTA 
AAAATACAAAAAATGAGCCAGGCATGGTGGCGGGCGCCTGTAGTCCCAGCTACTCAGGAG 
[T.C.G] 

CTGAGGCAGGA6AATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGACGAGATCACGC 
CACTGCACTCCAGCCTGGGCAACAGAGCGAGATTCCGTCTCAAAAAAAAAAAGAAAGAAA 
GG AAGAA GGGGGAATGGGGGAGAGGGGCCGGTCCC I II M GAGTCTAGCCTTCTGCGCAG 
GGGTTrTCTTCTCCAACGGGGAGCGGTGGAGGCAGCTGAGGAAGTTTACCATGCTTGCTC 
TGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGAGCTGATCCAGGCGGAGGCCCGGT 

5213 CTTTGGGAGGCTGAGGTGGGTGGATCACAAGGTCAGGAGATCGAGACCATCCTGGCTAAC 
ACAGTGAAACCCCATCTCTACTAAAAATACAAAAAATGAGCCAGGCATGGTGGCGGGCGC 
CTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCC6GGAGGCAGA 
GGTTGCAGTGAGACGAGATCACGCCACTGCACTCCAGCCTGGGCAACAGAGCGAGATTCC 
GTCTCAAAAAAAAAAAGAAAGAAAGGAA6AAGGGGGAATGGGGGAGAGGGGCCG6TCCCT 
[T,G,A,C] 

TTTGAG TCTA GCCTTCTGCGCAGGGGI I I I CTTCTCCAACGGGGAGCGGTGGAGGCAGCT 
GAGGAAGTTTACCATGCTTGCTCTGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGA 
GCTGATCCAGGCGGAGGCCCGGTGTCTGGTGGAGACATTCCAGGGGACAGAAGGTCAGCA 
TGGCGGGGTCACCCCAGGGTCTCCAGCCGAGTGAAAGGGAAAACTCrCCTACTGTGGCTG 
GGGGTGGCCCCAACCCAGGTCCTGGAATGGGCAGGAGGGGAAGCCTTGAACTCTAGGGCT 

5 508 TCCCI T I I rGAGTCTAGCCTTCTGCGCAGGGGTTTTCTTCTCCAACGGGGAGCGGTGGAG 

GCAGCTGAGGAAGTTTACCATGCTTGCTCTGCGGGACCTGGGCATGGGGAAGCGAGAAGG 
CGAGGAGCTGATCCAG6CGGAGGCCCGGTGTCTGGTGGAGACATTCCAGGGGACAGAAGG 
TCAGCATGGCGGGGTCACCCCAGGGTCTCCAGCCGAGTGAAAGGGAAAACTCTCCTACTG 
TGGCTGGGGGT6GCCCCAACCCA6GTCCTGGAATGGGCAGGAGGG6AAGCCTTGAACTCT 
tA,C,T,G] 

GGGCTGGCCTGGGGGTTCTGTTCACTGCCACCTTCTGTCTCTGTCCCACTGTCTCTCCGA 
GGCTGTCATGACATCTCT CTGT GTGTCTCTGGTGCTATCATCCCATTCTTCCTGGGTCTC 
CATCTCTCTCTCTGTCTCTTTTCTrrCTCTCTCCTTTCCTCTATTTTTTGGGCCCT^^ 
CTAT CTCTGTTTCTGTCTCCCTGTCTGTGTGATGGTCACTCTGrrTCrTTCTCCCTC^ 
GTTTCTCTGTCCCTATCTGTCTGTATCCI ICII I GCCTGTTTAGCTCTCTCCCTGCGCTG 
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5857 CTGTCTCTCCGAGGCTOTCATGACATCTCTCTGTGTGTCTCTGCrrGCTATC^^ 

TTCCTGGGTCTCCATCTCTCTCTCTGTCTCI I I I CTTTCTCTCTCCTTTCCTCT A I 11 II 
TGGGCCCTCAGTCTATCTCrGTTTCTGTCTCCCTGTCTGTGT GATG GTCA CTCT GTTTCT 
TTCTCCCTGTCTGTTTCTCTGTCCCTATCTGTCTGTATCCTTCTTTGCCTGTTTAGCTCT 
CTCCCTGCGCTGTCCATCCATCTTTCCCTGCCTCCCTGTCTCTCTCTGGTTGGGTTCAGC 
[C,G,AST] 

CCAACCTGCTCCCCTCTGCCTGGCTCCATCACAGCCTACCTCCCTGCCCCCATTCCCCCC 
AGGACGCCCATTCGATCCCTCCCTGCTGCTGGCCCAGGCCACCrCCAACGTAGTCTGCTC 
CCTCCTCTTTGGCCTCCGCTTCTCCTATGAGGATAAGGAGTTCCAGGCCGTGGTCCGGGC 
AGCTGGTGGTACCCTGCTGGGAGTCAGCTCCCAGGGGGGTCAGGTGAGTGGGTGGGACCC 
CTCTCCA^CTACC^TCCCTGAAGGTTCCTGCCAAGGTCCCATGAGAACTAGCTGCCC^^ 

6385 GTGGGTGGGACCCCrCTCCAACTACCTTCCCTGAAGGTTCCTGCCAAGGTCCCATGAGAA 
CTAGCTGCCCTTCTCCCCACAGACCTACGAGATGTTCTCCTGGTTCCTGCGGCCCCTGCC 
AGGCCCCCACAAGCAGCTCCTCCACCACGTCAGCACCITGGCTGCCTTCACAGTCCGGCA 
GGTGCAGCAGCACCAGGGGAACCTGGATGCTTCGGGCCCCGCACGTGACCTTGTCGATGC 
CTTCCTGCTGAAGATGGCACAGGTGTGGGAAGGGTGCAGGGACCCCCTCTCTGAATGGGC 
[C.T.G] 

TGGT6ACCTGGCAGGTCCCAGCCAGGTGTCCCTGGGGACCTCAATTGGGTTCCTCTCTCT 
TTCTCTCTCTGCATGTCTCTGTGAGTATGAGTGTCTCTGTGCATGTGTGTGCATCCCTTC 
TCTGCACATCTGTGCTGGCCCTTTCAGGGCGTTGCTCTCACTGCCTCTCCCGCCCCCGAC 
CTGGGCATTTGTGCCGGGCTGTCT6TCTCTCCAGCATCTCTCCTCTTTCTCCCTCCCACC 
TCGGCCCTTGTGTTCAGGCCCCATGCCCAGG6TCCTACACCAGCAATCCCCAGGATCACT 

6813 ATCTGTGCTGGCCCTTTCAGGGCGTTGCTCTCACTGCCTCTCCCGCCCCCGACCTGGGCA 
TTTGTGCCGGGCTGTCTGTCTCTCCAGC^TCTCTCCTCTTTCTCCCTCCCACCTCGGCCC 
TTGTGTTCAGGCCCCATGCCCAGGGTCCTACACCAGOXAT CCCCA GGATCACrTCATCCC 
ATCCCCTGCAGCCTCCCCAGACTTTTATGTAAATTCACAATTTTATGTGAATTATGGTCA 
TTrATTAGGAAGCCTTGCAATATCAAGTTATGTTAATAAAGTCCACTTTATTAATTATAT 
[C.T,A] 

AGAACAATATTTCTTTTCCI I II II I I MCI II I CTTTTTAAAGAGACAGGATCTCTTTC 
TGTTGCCCAGGCTAGAGTACAGTTGCAAAATCATAGCTCACTGCAACCTTGAACTCCTGG 
GCTCAAGCAATCCTCCTGCCTCGGGCTCCTGAGTAGCTGGGACAACAGGTGTGCACCACC 
ACACCTGGCTAAAI I I I I I I I I I I I CTTTGTAGAGATAGACTCTCACTATGTTACCCAGG 
CTGGTCTTGAATTCCTGGGCTCATGTAATCCTCCTGCTGCCTTGAACrCCCAAAGTGCTG 

7853 TCTTTAATTAAATGCAGCCTCCCTGTTTGTGGAGAAA GAAAA AI II MCI lAACCCTATT 

GCCCCATTTCTTTTCTCTTTTATTGAATAI I! I I I AGTTTTAACTATAGTAAAATACACA 
TAACGTTTACCATCTTAACOVTTTTTAGGTATACAGTACAGTAGTGTTCAGTACATTCAT 
ACTGTTATGCAATCAGTCTCCAGAACTCTTCATGTTGCAAAGCT GAAACTCTATACC CAT 
TAAACAACTGCCTGTTCCTCCCTCCTCCAACCCCTGGCAATCACCI I I I I I I I I I I GAGA 
[C,T] 

GAAGTCTCACTCTGTCACCCAGGCTAGAGTGCGGTGGCTCGATCTCGGCTCACTGCAAGC 
TCCGCCTCCCGGGTTCATGCCATTCTCCTGCCTCAGTCTCCCAAGCAGCTGGGACTACAG 
GTGCCCGTCACCACGCCTGGCTAAI MM I GTATTTTTAGTAGAGATGGAGTTTCATCGT 
GTTAGCCAGGCTGATCTCAAACTCCTGGCCrCAAGTGATCCACCCGCCTCGGCCTCCCAA 
AGTGCTGGGACTACAGGCGTGAGCCACTGTGCCTGGCCAGGAAGTAGACTCTTGATATTA 

12973 CTGTCTTCCTCTCTTTCTCTCI MCI I I CTTGACAGGGTCTCTCTTTGTCTCCCAGGCTG 

GGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAACTCCTAGGCTCAAGTGATCCT 
CCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCATACCACCATGCCTGGCTAATT 
TTTTAATTTTTCGTAGAGACAAGGTCTTGTTATATTGCCCAGGCTGGTCTCAAACTCCTG 
GGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCTGGGATTACAGGCATGAGCCAC 
[G,C,T] 

GCACGCCACTCAACACTCCACAAATGTTGATGCCATTATGTTTTGTGAACTAGTGT CCCT 
GGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATAGACTGGGGTGGGAAGGGGTTT 
ATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCC 
TGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATG 
CAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCTTCTCCTTAGGTATCTGCTGCA 
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12973 CTGTCTTCCTCTCTTTCTCT CI II C II I CTTGACAGGGTCTCTCTTTGTCTCCCAGGCTG 
GGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAACTCCTAGGCTCAAGTGATCCT 
CCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCATACCACCATGCCTGGCTAATT 
TTTTAAI I I i ICGTAGAGACAAGGTCTTGTTATATTGCCCAGGCTGGTCTCAAACTCCTG 
GGCrCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCTGGGATTACAGGCATGAGCCAC 
[A,G,T] 

GCACGCCACTCAACACTCCACAAATGTTGATGCCATTATGTTTTGTGAACTAGTGT CCCT 
GGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATAGACTGGGGTGGGAAGGGGTTT 
ATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCrrCCCCCTCC TTGGCT CCATCC 
TGCATGACCCCAAOVTCTTCAAGCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATG 
CAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCOTCTCCTTAGGTATCTGCTGCA 

13012 CTCTCrrTGTCTCCCAGGCTGGGGTGCAGTGGTACAAGCATAGCTCACAGCAGCCTTGAA 
CTCCTAGGCTCAAGTGATCCTCCCACGTCAGCCTCCTGAGCAGCTGGGACAACGGGCTCA 
TACCACCATGCCTGGCTAAI I II IIAAII II I CGTAGAGACAAGGTCTTGTTATATTGCC 
CAGGCTGGTCTCAAACTCCTGGGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCT 
GGGATTACAGGCATGAGCCACTGCACGCCACTCAACACTCCACAAATGTTGATGCCATTA 
[G,A,C ST] 

GTTTTGTGAACTAGTGTCCCTGGCACCCGAGACTTGTACTCCACACTCGAGGACCAAATA 
GACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCT 
TCCCCCrCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGTTCA 
ACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCT 
TCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCAGGC 

13072 CTCCTAGGCTCAAGTGATCCTCCCACGTCAGCCrCCTGAGCAGCTGGGACAACGGGCTCA 
TACCACCATGCCTGGCTAAI I I I MAAI I I I I CGTAGAGACAAGGTCTTGTTATATTGCC 
CAGGCTGGTCTCAAACTCCTGGGCTCAAATGCTTCTCTCACCTCAGCCTCCCACGTGGCT 
G6GATTACAGGCATGAGCCACTGCACGCCACTCAACACTCCACAAATGTTGATGCCATTA 
TGTTTTGTGAACTAGTGTCCCTGGCACCCGAGACTT6TACTCCACACTCGAGGACCAAAT 
[A,T,CSG] 

GACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGGTCT 
TCCCCCTCCTTGGCTCCATCCTGCAT6ACCCCAACATCTTCAAGCACCCAGAAGAGTTCA 
ACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGCCCT 
TCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCAGGC 
ATCTGTGCCAGCT6GGGGCACCCTTCTGCACCCTGGGCTTACTGTTGGCTCCTCCACCTG 

13370 ATAGACTGGGGTGGGAAGGGGTTTATAGTTTCATTATTATTTCCCCTCAGGGCACGGAGG 
TCTTCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAAGCACCCAGAAGAGT 
TCAACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGAGGCGTTCCTGC 
CCTTCTCCTTAGGTATCTGCTGCAGCCCTGGGTATCACAAGCAGGTGCTGGCGAACTCCA 
GGCATCTGTGCCAGCTGGGGGCACCCTTCTGCACCCTGGGCTTACTGTTG6CTCCTCCAC 
[C,G] 

TGCTGTTCCCCCCGTGGGCCTGGGTGTGAGGAATACTGACTCAGCCCTCTCTCTCTCTCT 
CTCCTCACCAGGGAAGCGTGTCT6CCTTGGAGAGGGCCTGGCAAAAGCGGAGCTCTTCCT 
CTTCTTCACCACCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGTACACCC 
TGAGCCTCAAGCCCACCGTCAGTGGCCI I I I CAACATTCCCCCAGCCTTCCAGCTGCAAG 
TCCGTCCCACTGACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGT 

13682 CCGTGGGCCTGGGTGTGAGGAATACTGACTCAGCCCTCTCTCTCTCTCTCTCCTCACCAG 
GGAAGCGTGTCTGCCTTGGAGAGGGCCTGGCAAAAGCGGAGCTCTTCCTCTTCTTCACCA 
CCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGTACACCCTGAGCCTCAAG 
CCCACCGTCAGTGGC C I I I I CAACATTCCCCCAGCCTTCCAGCTGCAAGTCCGTCCCACT 
GACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGTGGTGGGTGCCC 
[C.T,A] 

GGACGGTGCCTCCAGCCTCAACAGTGGGCATGGACAGGGTTAATGTCTCCAGAGTGTACA 
CTGCAGGCAGCCACATTTACACGCCTGCAGTTGI I I I CCGGAGTCTGTCCCACGGCCCAC 
ACGCTCACTTGACTCATGCTGCTAAGATGCACAACCGCACACCCATACACAACTACAAGG 
GCCACAAAGCAACTGCTGGGTTAGCTTTCCACAGACATAAATATAGTCCATCTGCAATCA 
CAAGCACATAGCCAGGTAACCCACCAACTCCCCTGGATCTGCAGCCCACACGTGGGAGTC 
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14631 GCAGCCCTCAGTCACCCC I I I I I AAGCACCCTGATTCTACCAAATGCAAACACATCTGGG 



TCTGCGATTATGCACAGAGACTTTGGACATACGAGGACCCTCAGACCGGAGGAACACCTG 
CCCAACCCCAACACGTGCTTATGTAACCACGTG6AAAGCGGCCCCTGCTGCCCCTCCACA 
CACACATACACACrCACrGATCrACAGCCCCTGTTCGGCGTCAGAGTCCCCACTAGACCC 
AGTGGAAGGGGTTAGAGACCAAGTAGGGGCCAGTTTCCAATTCACCCTGTCAGGGAGTGA 



CCGGATCTGACGTTCCTTGTGACTTAAGGGTCCGGCTTGGGAATTAAACil I I Oil ICTGG 
CCTTTA6CCTACTGCGTGTGTGACCCGTGTCAGTCACTGTGAGTAAGGGGTGGGGACAGG 
GGAGTCCACCCCTCCCCT6AGGCT6GGCGGGAGCTGAAAAACATG6CCACCGCCCACCCT 
GGCTGTTGACATCAGGACCAGATGTGGAGCTGGGAGGAGGGGCAGGGCTGGTGACGCCCT 
GGGCCTCATTTCCAAAAAGGGCCAAGGTGTCCGGCGGTGGGAAGTGGGCAAGGAGGGGGT 



[C,T,G] 



Chromosomal Map Position 
ePCR to dbSTS 
Site (bases) Marker 
15155.. 15288 stSG46708 



Chr. 
19 



Organism 
Homo sapiens 
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